Exploring Abbreviation Expansion for Genomic Information Retrieval
نویسندگان
چکیده
Abbreviations are commonly found instances of synonymy in Biomedical journal papers. Information retrieval systems that index paragraphs rather than full-text articles are more susceptible to term variation of this kind, since abbreviations are typically only defined once at the beginning of the text. One solution to this problem is to expand the user query automatically with all possible abbreviation instances for each query term. In this paper, we compare the effectiveness of two abbreviation expansion techniques on the TREC 2006 Genomics Track queries and collection. Our results show that for highly ambiguous abbreviations the query collocation effect isn’t strong enough to deter the retrieval of erroneous passages. We conclude that full-text abbreviation resolution prior to passage indexing is the most appropriate approach to this problem.
منابع مشابه
Extracting Useful Information from Clinical Notes
A new type of query, i.e., note query, which contains plentiful information of the patients, is given in this year’s CDS track. Previous results suggest that the additional information in the query may not lead to a better retrieval performance. Therefore, we proposed a method to extract important information from the clinical notes for retrieval. In addition, we also explored the expansion alg...
متن کاملQEA: A New Systematic and Comprehensive Classification of Query Expansion Approaches
A major problem in information retrieval is the difficulty to define the information needs of user and on the other hand, when user offers your query there is a vast amount of information to retrieval. Different methods , therefore, have been suggested for query expansion which concerned with reconfiguring of query by increasing efficiency and improving the criterion accuracy in the information...
متن کاملYork University at TREC 2009: Chemical Track
Our chemical experiments mainly focus on addressing three major problems in two chemical information retrieval tasks, Technology Survey (TS) task and Prior Art (PA) task. The three problems are: (1) how to deal with chemical terminology synonyms? (2) how to deal with chemical terminology abbreviation? (3) how to deal with long queries in Prior Art (PA) task? In particular, we propose a query ex...
متن کاملRePaLi Participation to CLEF eHealth IR Challenge 2014: Leveraging Term Variation
This paper describes the participation of RePaLi, a team composed with members of IRISA, LIMSI and STL, to the biomedical information retrieval challenge proposed in the framework of CLEF eHealth. For this first participation, our approach relies on a state-of-theart IR system called Indri, based on statistical language modeling, and on semantic resources. The purpose of semantic resources and ...
متن کاملA Combined Query Expansion Approach for Information Retrieval
In this paper we aim to contribute to meeting the information access needs of biology researchers who wish to retrieve abstracts from MEDLINE. The volume of research papers is growing daily due to factors such as increased research into the online database the genome and also a world-wide move to migrate information to online sources. We consider that one way to do this is by exploring the comb...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007